Variance-spectra based normalization for i-vector standard and probabilistic linear discriminant analysis
نویسندگان
چکیده
I-vector extraction and Probabilistic Linear Discriminant Analysis (PLDA) has become the state-of-the-art configuration for speaker verification. Recently, Gaussian-PLDA has been improved by a preliminary length normalization of i-vectors. This normalization, known to increase the Gaussianity of the i-vector distribution, also improves performance of systems based on standard Linear Discriminant Analysis (LDA) and ”two-covariance model” scoring. We propose in this paper to replace length normalization by two new techniques based on total, betweenand within-speaker variance spectra . These ”spectral” techniques both normalize the i-vectors length for Gaussianity, but the first adapts the i-vectors representation to a speaker recognition system based on LDA and two-covariance scoring when the second adapts it to a Gaussian-PLDA model. Significant performance improvements are demonstrated on the male and female telephone portion of NIST SRE 2010.
منابع مشابه
Non-Linear I-vector Extraction for Speaker Recognition
We propose an algorithm for non-linear i-vector extraction. The algorithm is based on the manifold learning technique named Diffusion Maps (DM) and motivated by recent results that showed that the GMM supervectors reside on a low dimensional manifold. Our proposed method may further be processed using standard techniques such as Linear Discriminant Analysis (LDA), Within Class Covariance Normal...
متن کاملComparative Evaluation of Feature Normalization Techniques for Speaker Verification
This paper investigates several feature normalization techniques for use in an i-vector speaker verification system based on a mixture probabilistic linear discriminant analysis (PLDA) model. The objective of the feature normalization technique is to compensate for the effects of environmental mismatch. Here, we study short-time Gaussianization (STG), short-time mean and variance normalization ...
متن کاملI–vector transformation and scaling for PLDA based speaker recognition
This paper proposes a density model transformation for speaker recognition systems based on i–vectors and Probabilistic Linear Discriminant Analysis (PLDA) classification. The PLDA model assumes that the i-vectors are distributed according to the standard normal distribution, whereas it is well known that this is not the case. Experiments have shown that the i–vector are better modeled, for exa...
متن کاملBlind score normalization method for PLDA based speaker recognition
Probabilistic Linear Discriminant Analysis (PLDA) has become state-of-the-art method for modeling i-vector space in speaker recognition task. However the performance degradation is observed if enrollment data size differs from one speaker to another. This paper presents a solution to such problem by introducing new PLDA scoring normalization technique. Normalization parameters are derived in a ...
متن کاملi-vector Based Speaker Recognition on Short Utterances
Robust speaker verification on short utterances remains a key consideration when deploying automatic speaker recognition, as many real world applications often have access to only limited duration speech data. This paper explores how the recent technologies focused around total variability modeling behave when training and testing utterance lengths are reduced. Results are presented which provi...
متن کامل